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© Solid phase Immunoassay for an antibody and biological constructions for use therein. 



^© An immunoassay for an antibody comprises: 

< (i) contacting a solid phase, on which is immobilised a first recombinant peptide which presents an 

m antigenic sequence to which the antibody is capable of binding, with a test sample; 
5- v (ii) contacting the solid phase with a second recombinant peptide which presents the said antigenic 
f- sequence, which is labelled and which was expressed in an organism of a different genus than that in which 

the first recombinant peptide was expressed; and 
© (iii) determining whether the test sample contained any said antibody. 

CO 

° A suitable protein for use in assaying for anti-p24 and anti-gp4l HIV-1 antibody is a fusion of a gag 
ftj sequence comprising amino acids 121-356 and an env sequence comprising amino acids 542-674. The amino 
acids are numbered according to Meusing et al. Nature, 313, 450-458 (1985). 
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FIGURE 3 

10 20 3 0 40 M 60 

MOHIOCC CKKMXG tOCKZKJt CAQGTCfcOCC AAAAT7ACQC TATNJTXAC 
>P» >p24 

70 60 90 100 110 120 

AACATCCAGC OXAAATOT ACATCW0CC ATATCACCft OACT3TWA TOCAIOETA 

130 140 ISO 160 170 100 

AAACIWnG AJTOUUK mOOOCCA OAOTGATV CCATOnTIC AOCATTMCA * 

ISO 200 210 220 230 240 

GAAOGWXA CCCCACAAGA TRMMCAOC AT0CTAAACA CAOTQX00C ACATCAAOCA 

250 200 270 200 290 300 

OCCATGCAAA TGTOAAACA CACCATCAAJ GHGAAGCTC CAGAAIQOtt WGWTOCAT 

310 320 330 340 350 360 

CCACTXATB GAOQOOOW TXAOCAQQC OCATOWW3 AACCAAflQQG AAC7TOACATA 

370 360 390 400 410 420 

KAOaACI* CWOaODCT TOOGAACAA AWBATOGA TGACNMTAA TOCACC7JOC 

430 440 430 460 470 460 

ccasinsu; AMcrmnA aagatqgata atcctoot waatmaat aoimgaajc 

490 500 310 520 S30 540 

wnocxm ooocaiict osmmma caaotccaa aagamxttt rama? 

550 560 570 560 590 600 

CTN^OOOCTT 1CTAXMAMC TCTWCW0C AQCAMETT CAOOCWXW AAAMATJQQ 

610 620 630 640 650 660 

ATGACAGAAA OCTJOTWW COAAAJ000 AAC0CAGA3T CTAACACTW TTXAAAAOCA 

670 660 690 700 710 720 

TTO3CMXA0 O0CTACACT AGAAGAAATG ATGACAOCAT QTCA0QGACT GCDVOGkCQC 

730 740 750 760 770 790 

AMTOTCA CACAAHWT CTOTt37!M\ OKKWCAOC ACAAOUTTT OCtGWCXET 
fell 

790 600 610 120 130 140 

URM00E AAOUOOCT GTTOCAACIC lOOKim VCH&MXK QCKOaOk 

950 060 670 MO 690 900 

AGW0CT88 OOTOGAAAO MMOTAAO QWCMOCC 9CCW00Btt TT 0088 H CC 

910 920 930 940 950 960 

1CT0GUAAC tCATTtaCAC CACTQCTOTC 0CTTQSM3G C7WTTOGAG BARAAICT 

970 960 990 1000 1010 1020 

CIOGAACAffl TnOGAAWA OOWOCT05 AIGGMRQ2G ACAOGAAAT TAACAATDC 

1030 1040 1050 1060 1070 1000 

ACAAGCTWA ttCACTOCTT AARCAAGAA TDCCAAAAOC AOnAGAAAA CAAT^A^^ 

1090 1100 1110 1120 1130 1140 

GAATtKJTQB AATWGAWA KT3XOMJT T1UTOCM7T OOmMCBB QGMQX3M " 



1a 



EP 0 307 149 A2 



IMMUNOASSAY AND BIOLOGICAL CONSTRUCTS FOR USE THEREIN 



This invention relates to immunoassays for antibodies and to test kits for use in such immunoassays. 
The most common expression systems for cloning antigens are those in which the antigens are 

* expressed in E. Coli However, E. Coli occurs noticeably as part of the gut flora. Many human sera therefore 
, contain antibodies to E. Coli. When antigens produced in E. Coli are used in sandwich immunoassays for 
* 5 antibody, there is the possibility that some individuals may react with contaminating bacterial Material. Such 

* a reaction may give a false positive result for that particular individual. This type of false positive may be 
minimised by mixing material from E. Coli with a test sample. 

We have now devised a new way of overcoming the problem of contaminating material. In essence, 
immunologically identical antigens are engineered into different organisms. Cloned antigen from different 

io organisms is therefore provided on either side of the "sandwich". Antibody it is not wished to detect in a 
test sample may bind to contaminating material associated with one of the cloned antigens but, because 
they are from different sources, not to contaminants associated with both cloned antigens. False positives 
can thus be avoided. Importantly, it is not then necessary to ensure that the cloned antigens are absolutely 
free of contaminating proteins. The specificity of the assay is greatly improved. The preparation of the 

75 antigens is simplified. 

Accordingly, the present invention provides an immunoassay for an antibody, which immunoassay 
comprises: 

(!) contacting a solid phase, on which is immobilised a first recombinant peptide which presents an 
antigenic sequence to which the antibody is capable of binding, with a test sample; 
20 (ii) contacting the solid phase with a second recombinant peptide which presents the said antigenic 

sequence, which is labelled and which was expressed in an organism of a different genus than that in which 
the first recombinant peptide was expressed; and 

(iii) determining whether the test sample contained any said antibody. 

25 Any antibody may be assayed for in this way. The recombinant peptides must both include the same 
antibody binding site. Detection depends on the fact that antibodies have at least two antigen combining 
sites, two for IgG, IgA and IgE and five for IgM. Taking IgG as an example, one of the antigen combining 
sites binds with the first recombinant peptide which is immobilised on a solid phase. The labelled second 
recombinant peptide binds with the second antigen combining site of IgG. 

30 The recombinant peptides are engineered in organisms of a different genus. Preferably, each organism 
is of a different family. For example, different bacterial expression systems may be used. Alternatively, one 
antigen cloned in a bacterial expression system and one cloned in yeast or in insect or mammalian cells 
may be used. Most preferably, one antigen is cloned in a procaryotic organism whilst the other is cloned in 
a eucaryotic system. Examples of suitable hosts for cloning include B. subtilis, E. Coli, Streptomyces , insect 

35 cells, yeast and mammalian cells. A baculovirus expression system or a vaccinia virus expression system, 
in which peptides are expressed in insect and mammalian cells respectively, are preferred eucaryotic 
alternatives. E. Coli is a preferred procaryotic host. 

The first and second recombinant peptides need not be identical in structure. It is permissible for one to 
be longer than the other. They must both present the antigenic sequence to which the antibody it is wished 

40 to detect is capable of binding, however. In other words, each must include the same antibody binding site. 
By the term "peptide" we include proteins. 

Any appropriate label may be attached to the second recombinant peptide. The label may be an 
enzyme, radioisotope or other reagent which provides colorimetric or fluorometric activity. A preferred label 
is alkaline phosphatase. A cyclic amplification reaction can be initiated using the label. The alkaline 

45 phosphatase reacts with the substrate nicotinamide adenine dinucleotide phosphate (NADP) to form 
nicotinamide adenine dinucleotide (NAD). The NAD is utilised in a cycling reaction involving alcohol 
dehydrogenase, diaphorase and iodonitrotetrazolium violet which leads to a coloured product. 

The immunoassay is particularly suitable for use in determining the presence of HIV antibody. 

* especially HIV-1 antibody, in a sample. Cloned HIV antigens from different sources may be provided. The 
50 antigens may be gag and/or env sequences. HIV-1 provokes in particular two types of antibody. These are 

anti-p24 against the gag protein and anti-gp41 against the env protein. 

Our preferred gag sequence corresponds to amino acids 121-356. This incorporates all of the sequence 
of p24. Our preferred env sequence corresponds to amino acids 542-674. Advantages of this gp4l 
sequence are that it is semi-soluble and that it includes the dominant epitope of the env gene. The 
numbering is according to Mousing et al Nature 313 450-458 (1 985). 
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Preferably, each sequence is fused to 0-galactosidase (/S-gal). This facilitates purification by affinity 
chromatography, using either an anti-galactosidase affinity column or a substrate column. Alternatively, the 
two sequences may be fused together as a gag/env fusion protein. Early seroconversion may be against 
either the gag or the env antigen or both. Desirably, therefore, both the first and the second recombinant 
peptides present both the gag and the env sequences. The gag/0-gal and env/jS-gal fusion proteins may be 
used to achieve this or the gag/env fusion protein may be used on its own. 

A preferred gag/env fusion protein is a protein of the sequence: 

10 20 
MetAsnSerProAspThrGlyHisSerSerGlnValSerGlnAsnTyrProIleValGln 
pl8> P 24> 

30 40 
AsnlleGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrpVal 

50 60 
LysValValGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeuSer 

70 80 
GluGlyAlaThrProGlnAspLexiAsnThrHetLeuAsnThrValGlyGlyHiBGlnAla 

90 100 
AlaMetGlnMetLeuLysGluThrlleAsnGluGluAlaAlaGluTrpAspArgValHis 

110 120 
ProValHisAlaGlyProIleAlaProGlyGlnMetArgGluProArgGlySerAspIle 

130 140 
AlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProProIle 

150 160 
ProValGlyGlulleTyrLysArgTrpIlelleLeuGlyLeuAsnLyslleValArgMet 

170 180 
TyrSerProThrSerlleLeuAsplleArgGlnGlyProLysGluProPheArgAspTyr 

190 200 
ValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnGluValLysAsnTrp 

iu m 

MetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLysAla 

230 240 
LeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGlyPro 

250 260 
AsnSerProArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsnLeuLeuArgAla 
gp41> 

270 280 
IleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGlnAla 

290 300 
ArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCys 

310 320 
SerGlyLysLeulleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSer 
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330 340 
LeuGluGlnlleTrpAsnAsnMetThrTrpMetGluTrpAspArgGluIleAsnAsnTyr 

350 , 3 f° 

ThrSerLeuIleHisSerLeuIleGluGluSerGlnAsnGlnGlnGluLysAsnGluGln 

5 

370 

GluLeuLeuGluLeuAspLysTrpAlaScrLeuTrpAsnTrpPheAsnGlyAspPro; 

10 optionally modified by one or more amino acid substitutions, insertions and/or deletions and/or by an 
extension at either or both ends provided that a protein having such a modified sequence is capable of 
binding to both anti-p24 and anti-gp41 and there is a degree of homology of at least 75% between the 
modified and the unmodified sequences. 

The unmodified sequence is basically a fusion of parts of the p24 and gp4l proteins of the CBL-1 

76 isolate of HIV-1 (WO 86/04423). These parts correspond to amino acids 121 to 356 and 542 to 674 
respectively. The start of these parts is shown above at amino acids 17 and 244 respectively. Amino acids 
5 to 16 above are derived from the p18 protein. Amino acids 1 to 4, 241 to 243 and 377 to 379 above are 
derived from the expression vector from which the fusion construct was obtained and from DNA manipula- 
tions. 

20 The sequence may be modified by one or more amino acid substitutions, insertions and/or deletions. 
These may occur anywhere in the sequence but especially in the parts of the sequence which are not 
derived from the p24 and gp41 proteins. In the case of substitutions, one or more of the amino acids of the 
unmodified sequence may be substituted by one or more other amino acid which preserves the 
physicochemical character of the sequence, i.e. in terms of charge density, hydrophilicity/hydrophobicity, 

25 size and configuration. For example, Ser may be replaced by Thr and vice versa , Glu may be replaced by 
Asp and vice versa and Gin may be replaced by Asn and vice versa . The"§er residue at amino acid 10 may 
be replacedby Asn. 

The sequence may also be extended on one or both ends. This may be no more than the provision of 
an additional carboxy-terminal Cys residue. However, the sequence may be extended by up to 50 amino 

30 acid residues at either or both ends. Up to 40 amino acids, for example up to 20 amino acids, may 
therefore be added to the amino-terminus and/or carboxy-terminus of the unmodified sequence. Ttie amino- 
terminal amino acid, however, will normally be Met due to the translations start codon of the nucleic acid 
sequence from which the protein is expressed. This is unless the protein has been expressed fused at its 
amino-terminus to a carrier protein and the fusion protein has been cleaved to release the protein of the 

35 invention. 

The sequence may be modified by introducing corresponding changes into the DNA sequence 
encoding the unmodified protein. This may be achieved by any appropriate technique, including restriction 
of the sequence with an endonuclease, insertion of linkers, use of an exonuclease and/or a polymerase and 
site-directed mutagenesis techniques. Whether the modified DNA sequence encodes a modified protein to 

40 which both anti-p24 and anti-gp4l are capable of binding can be readily determined. The modified 
sequence is cloned Into an appropriate plasmid, a host cell is transformed with the plasmid and the protein 
that is expressed is tested for its ability to bind anti-p24 and anti-gp41. Also, there must be a degree of 
homology of at least 75%, for example of 85% or more or of 90% or more, between the amino acid 
sequences of the modified and unmodified proteins. 

45 The fusion proteins gag/0-gal, env/0-gal and gag/env can be expressed in E Coli transformants. Such 
gag//3-gal and env/0-gal fusion proteins or such a gag/env fusion protein can be on one side of a 
"sandwich". Preferably, it is these fusion proteins which are labelled and which constitute the second 
recombinant peptide(s). 

As the source of the gag and env antigens on the other side of the "sandwich", any appropriate 
50 expression system may be used. Preferably, however, a baculovirus expression system is used in which 
the polyhedrin gene incorporates or is replaced by a DNA sequence encoding the antigenic gag and/or env 
sequences. The recombinant peptide is expressed in insect cells. Typically, the host is Spodoptera 
frugiperda. A larger env sequence may be expressed which incorporates at least the amino acids 542-674, 
for example an env sequence corresponding to amino acids 24-750. The peptides cloned using the 
55 baculovirus expression system are preferably the peptides which are immobilised on the solid phase. 

In carrying out the immunoassay, the first recombinant peptide is immobilised on a solid phase. The 
solid phase may be polystyrene beads, plastic microwells, etc. The peptide may be adsorbed onto the solid 
phase or bound there by antibody. A test sample is then brought into contact with the solid phase. A 
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sample of any appropriate physiological fluid may constitute the test sample, for example urine, plasma or 
serum. Antibody in the test sample which is specific for the peptide immobilised on the solid phase binds to 
the peptide. Any antibody in the test sample which is specific for contaminating material derived from the 
host in which the peptide was cloned may bind to such material. 

5 The labelled second recombinant peptide is also brought into contact with the solid phase. This may be 
done after the test sample has been brought into contact with the solid phase. Alternatively, the two steps 
can be effected simultaneously. Antibody in the test sample which binds to the first peptide also binds to 
the labelled second peptide and can thus be detected. Antibody in the test sample which binds to the host- 
derived contaminants associated with the first peptide does not bind to any labelled host-derived con- 

to taminants associated with the second peptide and vice versa . False positives may thus be avoided. 

The immunoassay can be effected qualitatively, i.e. it can be carried out simply to detect the presence 
or absence of a particular antibody in a test sample. Alternatively, it may be effected quantitatively or semi- 
quantitatively to give a measure of how much antibody there is in the sample. 

The materials for use in the immunoassay may be presented in a test kit. Such a test kit typically 

rs comprises: 

(a) a solid phase on which is immobilised a first recombinant peptide which presents an antigenic 
sequence to which the antibody is capable of binding; and 

(b) a second recombinant peptide which presents the said antigenic sequence, which is labelled and 
which was expressed in an organism of a different genus than that in which the first recombinant peptide 

20 was expressed. 

The kit may also include a substrate for the label where the label is an enzyme. Further components 
can be wash fluids and controls. The kit may also contain materials for carrying out a second immunoassay 
alongside the present immunoassay. The second Immunoassay may be for a different antibody, in which 

25 case it may be effected in a similar fashion. 

If the second immunoassay is for the purpose of determining whether a test sample also contains a 
particular antigen, immobilised on the solid phase may be additionally a first antibody capable of binding to 
the antigen. A labelled second antibody for detecting antigen bound to the first antibody is also provided. 
Each antibody may be monoclonal or polyclonal. The two antibodies may be capable of binding to different 

30 sites on the antigen. Where the antigen is polymeric, the antibodies can be capable of binding to the same 
site. The labelled second antibody is brought into contact with the solid phase at the same time as the test 
sample or subsequently. The label may be the same as that on the second recombinant peptide when all 
that is required is an indication whether the test sample contains the antibody and/or antigen being tested 
for. 

35 It is thus possible to provide an immunoassay for both HIV antibody, in particular HIV-1 antibody, and 
hepatitis B surface antigen (HBsAg). Assay for HIV antibody may be effected using the antigens already 
described. Two different antibodies for HBsAg can be employed to assay for the HBsAg, each reacting with 
a different site on the antigen. Each antibody may be monoclonal or polyclonal. Preferably, each is a 
monoclonal antibody so that the solid phase may be contacted with the test sample and the labelled second 

40 antibody simultaneously. Again preferably, the labels on the second recombinant peptide for use in 
detecting the HIV antibody and on the second HBsAg-specific antibody are the same. 

The immunoassay for HIV antibody may be u.sed as a prognostic test. HIV-1 infected subjects initially 
have a fairly constant high titre of anti-p24 antibody when free of AIDS symptoms. On the other hand, a 
lower or falling titre of anti-p24 antibody is significantly associated with clinical progression and provides an 

45 indicator for the onset of AIDS or AIDS-related complex (ARC) (Weber et al (1987) The Lancet, 17 January). 
In order to effect a prognostic test, it is necessary therefore to follow the anti-p24 antibody titre in a 
subject over a period of time. An immunoassay may therefore be conducted using cloned p24 peptides 
obtained from different organisms. Preferred are the p24 constructs discussed above. When the anti-p24 
antibody titre drops, this is considered to be a marker for a poor prognosis. 

50 The present invention also relates to fusion constructs, their preparation and their use in assaying for 
anti-HIV-1 antibody and as a vaccine. 

A variety of assays have been proposed for anti-HIV-1 antibody. However, there are problems with false 
positive and false negative results arising. HIV-1 provokes in particular two types of antibody. These are 
anti-p24 against the gag protein and anti-gp41 against the env protein. We have now prepared a specific 

55 fusion construct to which both anti-p24 and anti-gp41 bind. This enables accurate and sensitive assays to 
be carried out without the risk of false positive or false negative results. 
Accordingly, the present invention provides a protein of the sequence: 
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10 20 
MetAsnSerProAspThrGlyHisSerSerGlnValSerGlnAsnTyrProIleValGln 

pl8> P24> 

30 40 
AsnlleGlnGlyGlnMetValHisGlnAlalleSerProArgThrLeuAsnAlaTrpVal 

50 60 
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LysValValGluGluLysAlaPheSerProGluVallleProMetPheSerAlaLeuSer 

70 80 
GluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGlnAla 

90 100 
AlaMetGlnMetLeuLysGluThrlleAsnGluGliaAlaAlaGluTrpAspArgValHis 

110 120 
ProValHisAlaGlyProlleAlaProGlyGlnMetArgGluProArgGlySerAsplle 

130 140 
AlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProProIle 

150 160 
ProValGlyGluIleTyrLysArgTrpIlelleLeuGlyLeuAsnl^ysIleValArgMet 

170 180 
TyrSerProThrSerlleLeuAspIleArgGlnGlyProLysGluProPheArgAspTyr 

190 200 
ValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnGluValLysAsnTrp 

210 220 
MetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLysAla 

230 240 
LeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGlyPro 

250 260 
AsnSerProArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsnLeuLeuArgAla 
gp41> 

270 280 
IleGluAlaGlnGlnHisLeuLeuGlnLeuThrValTrpGlylleLysGlnLeuGlnAla 

290 300 
ArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCys 

310 320 
Se rGlyLysLeulleCysThrThrAlaValProTrpAsnAlaSe rTrpSe rAsnLysSe r 

330 340 
LeuGluGlnlleTrpAsnAsnMetThrTrpMetGluTrpAspArgGluIleAsnAsnTyr 

350 360 
ThrSerLeulleHisSerLeuIleGluGluSerGlnAsnGlnGlnGluLysAsnGluGln 

370 

GluLeuLeuGluLeuAspLysTrpAlaSerLeuTrpAsnTrpPheAsnGlyAspPro; 

optionally modified by one or more amino acid substitutions, insertions and/or deletions and/or by an 
extension at either or both ends provided that a protein having such a modified sequence is capable of 
binding to both anti-p24 and anti-gp41 and there is a degree of homology of at least 75% between the 
modified and the unmodified sequences. 

The unmodified sequence is basically a fusion of parts of the p24 and gp41 proteins of the CBL-1 
isolate of HIV-1 (WO 86/04423). These parts correspond to amino acids 121 to 356 and 542 to 674 
respectively, following a similar numbering system to that of Mousing et al. Nature, 313. 450-458 (1985). 
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The start of these parts is shown above at amino acids 17 and 244 respectively. Amino acids 5 to 16 above 
are derived from the p18 protein. Amino acids 1 to 4, 241 to 243 and 377 to 379 above are derived from the 
expression vector from which the fusion construct was obtained and from DNA manipulations. 

The sequence may be modified by one or more amino acid substitutions, insertions and/or deletions. 

5 These may occur anywhere in the sequence but especially in the parts of the sequence which are not 
derived from the p24 and gp41 proteins. In the case of substitutions, one or more of the amino acids of the 
unmodified sequence may be substituted by one or more other amino acid which preserves the 
physicochemical character of the sequence, i.e. in terms of charge density, hydrophilicity/hydrophobicity, 
size and configuration. For example, Ser may be replaced by Thr and vice versa , Qlu may be replaced by 

10 Asp and vice versa and Gin may be replaced by Asn and vice versa . The Ser residue at amino acid 10 may 
be replacedby Asn. 

The sequence may also be extended on one or both ends. This may be no more than the provision of 
an additional carboxy-termlnal Cys residue. However, the sequence may be extended by up to 50 amino 
acid residues at either or both ends. Up to 40 amino acids, for example up to 20 amino acids, may 
15 therefore be added tp the amino-terminus and/or carboxy-terminus of the unmodified sequence. The amino- 
terminal amino acid, however, will normally be Met due to the translation^ start codon of the nucleic acid 
sequence from which the protein is expressed. This is unless the protein has been expressed fused at its 
amino-terminus to a carrier protein and the fusion protein has been cleaved to release the protein of the 
invention. 

20 The sequence may be modified by introducing corresponding changes into the DNA sequence 
encoding the unmodified protein. This may be achieved by any appropriate technique, including restriction 
of the sequence with an endonuclease, insertion of linkers, use of an exonuclease and/or a polymerase and 
site-directed mutagenesis techniques. Whether the modified DNA sequence encodes a modified protein to 
which both anti-p24 and anti-gp41 are capable of binding can be readily determined. The modified 

25 sequence is cloned into an appropriate plasmid, a host cell is transformed with the plasmid and the protein 
that is expressed is tested for its ability to bind anti-p24 and anti«gp41. Also, there must be a degree of 
homology of at least 75%, for example of 85% or more or of 90% or more, between the amino acid 
sequences of the modified and unmodified proteins. 

A protein of the invention is prepared by a process comprising: 

30 (i) transforming a host cell with a vector which incorporates a gene encoding the protein and which is 

capable, in the host cell, of expressing the protein; 

(ii) culturing the transformed host cell so that the protein is expressed; and 

(iii) recovering the protein. 

35 The gene encoding the protein is preferably constructed in two parts, an amino-terminal part incorporat- 
ing the DNA sequence encoding the p24 amino acid residues and a carboxy-terminal part incorporating the 
DNA sequence encoding the gp4l amino acid residues. The two parts are then fused together and inserted 
in an expression vector. The gene is provided with appropriate transcriptional regulatory sequences and 
translations start and stop codons. The gene is therefore inserted in the expression vector in the correct 

40 reading frame with respect to an ATG start codon under the control of a promoter. 

Any suitable expression vector may be employed, for example a plasmid or a viral vector such as a 
recombinant baculovirus or recombinant vaccinia virus. The host transformed with the expression vector 
may be eucaryotic or procaryotic. for example unicellular microorganisms or mammalian cells. As a 
unicellular eucaryotic host, Saccharomyces cerevisiae , S. kluveromyces and S. pombe may be mentioned. 

46 Strains of bacteria such as E. coli, B. subtilis or B. thermophilus may be used as procaryotic hosts. E coli 
is preferred. The transformed host is cultured and the protein that is expressed is recovered. 

A protein of the invention can be used in assays for anti-HIV-1 antibody, in particular for anti-p24 and/or 
anti-gp41. A test sample of any appropriate physiological fluid may be used in the assay, for example urine, 
plasma, blood or serum. The assay method comprises contacting a test sample with a protein of the 

50 invention and determining whether any anti-p24 and/or anti-gp41 binds to the protein. For this purpose, a 
test kit may be provided comprising a protein of the invention and means for determining whether any anti- 
p24 and/or anti-gp41 there may be in a test sample binds to the protein. 

A variety of assay formats may be employed. The protein can be used to selectively capture anti-p24 
and/or anti-gp41 from solution, to selectively label anti-p24 and/or anti-gp41 already captured, or to both 

55 capture and label. In addition the protein may be used in a variety of homogeneous assay formats in which 
the antibodies which react with the protein are detected in solution with no separation of phases. The 
protein can also be used for HIV-1 antigen detection. 

The types of assay in which the protein is used to capture antibodies from solution involve immobiliza- 
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tion of the protein onto a solid surface. This surface should be capable of being washed in some way. The 
sort of surfaces which may be used are polymers of various types (moulded into microtitre wells; beads; 
dipsticks of various types; aspiration tips; electrodes; and optical devices), particles (for example latex; 
stabilized blood, bacterial or fungal cells; spores; gold or other metallic sols; and proteinaceous colloids; 
5 with the usual size of the particle being from 0.1 to 5 microns), membranes (for example nitrocellulose; 
paper; cellulose acetate; and high porosity/high surface area membranes of an organic or inorganic 
material). 

The attachment of the protein to the surfaces can be by passive adsorption (for which it is ideally suited 
by virtue of its hydrophobic nature) from a solution of optimum composition which may include surfactants, 

to solvents, salts, chaotropes; or by active chemical bonding. Active bonding may be through a variety of 
reactive or activatible functional groups which may be attached to the surface (for example condensing 
agents; active esters, halides, anhydrides; amino, hydroxyl, or carboxyl groups; sulphydryl groups; carbonyl 
groups; diazo groups; unsaturated groups). Alternatively the active bonding may be through another protein 
(itself attached to the surface passively or through active bonding), e.g. through polyclonal or monoclonal 

75 antibody directed against any or all of the epitopes presented by the protein, or through a carrier protein 
such as albumin or casein, to which the protein may be chemically bonded by any of a variety of methods 
and which may confer advantages because of isoelectric point, charge, hydrophilicity or other physico- 
chemical property. The protein may also be attached to the surface (usually but not necessarily a 
membrane) following electrophoretic separation of a reaction mixture e.g. an immune precipitation. 

20 After reacting the surface bearing the protein with a solution contain antibody of interest and removing 
the excess of the sample where necessary by any of a variety of means (washing, centrifugation, filtration, 
magnetism, capilliary action), the captured antibody is detected by any means which will give a detectable 
signal. For example, this may be achieved by use of a labelled molecule or particle as defined above which 
will react with the captured antibody (for example protein A or protein G and the like; anti-species or anti- 

25 immunoglobulin-sub-type; rheumatoid factor; antibodies to any of the epitopes contained in the protein and 
used in a competitive or blocking fashion; or any molecule containing the epitopes of the protein, including 
the protein itself and other proteins and peptides derived directly or indirectly from HIV-1). 

The detectable signal may be optical or radio-active or physico-chemical, provided by directly labelling 
the molecule referred to with for example a dye, radiolabel, electroactive species, magnetically resonant 

30 species or fluorophore; or indirectly by labelling the molecule or particle with an enzyme itself capable of 
giving rise to a measurable change of any sort. Alternatively the detectable signal may be due to, for 
example, agglutination, diffraction effect or birefringent effect occurring if any of the surfaces referred to are 
particles. 

Those types of assay in which the protein is used to label an already captured antibody require some 

as form of labelling of the protein which will allow it to be detected. The labelling can be direct, by chemically 
or passively attaching for example a radio-, magnetic resonant-, particle or enzyme label to the protein; or 
indirect by attaching any form of label to a molecule which will itself react with the protein, e.g. antibody to 
any of the epitopes of the protein, with subsequent reaction of the labelled motcule with the protein. The 
chemistry of bonding a label to the protein can be directly through a moiety already present in the protein. 

40 such as an amino or sulphydryl or through an inserted group such as a maleimide. Capture of the antibody 
may be on any of the surfaces already mentioned, by any reagent, including passive or activated 
adsorption, which will result in specific antibody or immune complexes being bound. In particular capture of 
the antibody could be by anti-species or anti-immunoglobulin-sub-type, by rheumatoid factor, proteins A, G 
and the like, or by any molecule containing any of the epitopes of the protein as described above. 

45 For those assays in which the protein is used to provide a measure of HIV-1 antigen in a sample, the 
protein may be labelled in any of the ways described above, and used in either a competitive binding 
fashion so that its binding by any specific molecule on any of the surfaces exemplified above is blocked by 
antigen in the sample, or in a non-competitive fashion when antigen in the sample is bound specifically or 
non-specifically to any of the surfaces above, in turn binds a specific bi- or polyvalent molecule (e.g. an 

so antibody) and the remaining valances of the molecule are used to capture the labelled protein. 

In general in homogeneous assays the protein and an antibody are labelled, so that, when the antibody 
reacts with the protein in free solution, the two labels interact, for example to allow non-radiative transfer of 
energy captured by one label to the other label, with appropriate detection of the excited second label or 
quenched first label (e.g. by fluorimetry, magnetic resonance or enzyme measurement). Addition of either 

55 antigen or antibody in a sample results in restriction of the interaction of the labelled pair, and so to a 
different level of signal in the detector. 

The protein of the invention may be used in a sandwich immunoassay for anti-p24 or anti-gp41 in which 
recombinant antigens, which present the same antigenic sequence but which have been expressed in 
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different organisms, are present on either side of the sandwich. The most common expression systems for 
cloning antigens are those in which the antigens are expressed in E coil. However, E coll occurs noticeably 
as part of the gut flora. Many human sera therefore contain antibodies to E coli. When antigens produced 
in E. coli are used in sandwich immunoassays for antibody, there is the possibility that some individuals 
5 ma7 react with contaminating bacterial material. Such a reaction may give a false positive result for that 
particular individual. This type of false positive may be minimised by mixing material from E coli with a test 
sample. 

We have now devised a new way of overcoming the problem of false positives due to antibodies to 
contaminating material. In essence, immunologically identical antigens are engineered in different or- 
10 ganisms. Cloned antigen from different organisms is therefore provided on either side of the "sandwich". 
Antibody it is not wished to detect in a test sample may bind to contaminating material associated with one 
of the cloned antigens but, because they are from different sources, not to contaminants associated with 
both cloned antigens. False positives can thus be avoided. Importantly, it is not then necessary to ensure 
that the cloned antigens are absolutely free of contaminating proteins. The specificity of the assay is greatly 
is improved. The preparation of the antigens is simplified. 

In esence, an immunoassay in this format for anti-p24 and/or anti-gp41 antibody comprises: 

(i) contacting a solid phase, on which is Immobilised a first recombinant peptide which presents an 
antigenic sequence to which the antibody is capable of binding, with a test sample; 

(ii) contacting the solid phase with a second recombinant peptide which presents the said antigenic 
20 sequence, which is labelled and which was expressed in a different organism than the first recombinant 

peptide; and 

(iii) determining whether the test sample contained any said antibody. 

The first recombinant peptide and/or the second recombinant peptide is a recombinant protein 
25 according to the invention. The first and second recombinant peptides need not be identical in structure. It 
is permissible for one to be longer than the other, for example. They must both present the antigenic 
sequence to which the antl-p24 and/or anti-gp41 it is wished to detect are/is capable of binding, however. In 
other words, each must include the same antibody binding site(s). 

Two recombinant proteins of different sequences according to the invention or two recombinant proteins 
30 of the same sequence according to the invention may therefore by used. Alternatively, only one of the 
recombinant peptides is a protein according to the invention. In these circumstances, however, the other 
peptide and the recombinant protein according to the invention must present a common epitope. The other 
recombinant peptide may be a gag/env fusion or just a gag or an env peptide. 

Detection of anti-p24 and/or anti-gp41 depends on the fact that antibodies have at least two antigen 
35 combining sites, two for IgG, IgA and IgE and five for IgM. Taking IgG as an example, one of the antigen 
combining sites binds with the first recombinant peptide which is immobilised on a solid phase. The 
labelled second recombinant peptide binds with the second antigen combining site of IgG. 

The recombinant peptides in this assay format are engineered in different organisms. Preferably, the 
genus at least of each organism is different. More preferably, each organism is of a different family. For 
40 example, different bacterial expression systems may be used. Alternatively, one antigen cloned in a 
bacterial expression system and one cloned in yeast or in insect or mammalian cells may be used. Most 
preferably, one antigen is cloned in a procaryqtic organism whilst the other is cloned in a eucaryotic 
system. Examples of suitable hosts for cloning include B. subtilis , E coli, Streptomyces , insect cells, yeast 
and mammalian cells. A baculovirus expression system or a vaccinia virus expression system, in which 
45 peptides are expressed in insect and mammalian cells respectively, are preferred eucaryotic alternatives. E 
coli is a preferred procaryotic host 

A protein of the invention may also be used as a vaccine against HIV-1. For this purpose, the protein 
may be formulated in a pharmaceutical composition with a pharmaceutical^ acceptable carrier or diluent. 
The protein may be presented as an injectable formulation. Suitable diluents include Water for Injections 
50 and isotonic saline solution. The composition is administered parenterally, for example intravenously, 
intramuscularly or subcutaneously. An effective amount is given to a human. Typically a dose of from 10 to 
200ug is given parenterally. 

The following Examples illustrate the invention. In the accompanying drawings: 
Figure 1 shows the preferred env sequence as incorporated in pDM322 in Example 1; 
55 Figure 2 shows the preferred gag sequence as Incorporated in pDM6H in Example 1; 

Figure 3 shows the DNA sequence of the gag/env protein of Example 1 with vector-related 
sequences are shown in bold; 

Figure 4 shows the construction of pFOHc in Example 3; 
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Figure 5 shows the shuttle vector pvFOHC, the sequence of which contains two in-phase initiation 
codons separated by the FMDV VP1 142-160 sequence and six amino acids of the authentic HB "pre-core" 
sequence; 

Figure 6 presents the results of the sandwich EUSA in Example 3; and 
5 Rgure 7 shows the sucrose gradient profile of core reactive material obtained in Example 3. 



Example 1. E. coll expression constructions 

10 

A lambda gt10 (DNA Cloning vol. 1, Editor - D.M. Glover, IRL Press 1985) cDNA library was 
constructed from poly(A + )-RNA from CEM cells infected with the British isolate of HIV (CBL-1) by standard 
techniques (Molecular Cloning (1982) Maniatis et al Cold Spring Harbor Press). Cells of a leukaemic T-cell 
line designated CCRF-CEM which harbour CBM have been deposited at ECACC, Porton Down, QB on 11 

75 January 1985 under deposit number 85 01 1101. 

A lambda recombinant containing the entire envelope gene was identified and the EcoRI insert fragment 
was subcloned into the plasmid vector pUC8 to produce the plasmid pDP4. The exact location of the 
envelope gene within pDP4 was determined by comparison with published data following restriction enzyme 
analysis and partial DNA sequencing (Mousing et al (1985) Nature 313 450-458, Wain-Hobson et al (1985) 

20 Cell 40 9-1 7, Sanchez-Pescador et al (1 985) Science 227 484-492). 

We expressed parts of the HIV genome in E. coli by fusing fragments to the lacZ gene. The expression 
vector chosen was pXY460. This is an Open Reading Frame vector where the strong tac promoter drives a 
mutated lacZ gene in which the initiating ATG and the coding sequence for 0-galactosidase are out of 
frame. The vector is derived from pXY410 (Winther et al J. Immunol. 136 (1986) 1835-1840). There are 

25 restriction sites for EcoRI, Smal and BamHI at the start of tie lacZ gene. Any DNA fragments inserted here 
which can restore the reading frame will produce fusion proteins consisting of the sequence encoded by the 
insert attached to 0-galactosidase. 

EcoRI BamHI 

30 ... ATG AAT TCCCGGG GAT CC lacZ 

Smal 

Vector pXY460, harboured In E. coli HB101, was deposited at NCIMB, Aberdeen, QB on 26 July 1988 under 
_ accession number NCIB 40039. 



a) Envelope 

The region of the HIV-1 env gene coding for amino acids 542-674 was cloned into pXY460 in the 
following manner. A DNA sequence containing a BamHI restriction site was introduced into the gene at 
nucleotide position 8276. The result of this was to add on two amino acids (Gly-Asp) at position 674 and to 
introduce a BamHI site in-frame with the pXY460 vector. 

674 BamHI 

45 env....TGG TTT AAC GGG GAT CC 

...Trp Phe Asn Gly Asp Pro 



There is a site for the enzyme Haelll at position 7875 (amino acid 541) which is in-frame with the Smal 
so site in pXY460. The 403bp Haelll/BamHI fragment was purified by elution from an agarose gel slice and 
ligated with Smal/BamHI digested pXY460. The ligated DNA was transformed into E. coli strain TG1 and 
recombinants were selected as blue colonies on L-Amp-Xgal agar plates. Individual transformants were 
characterised by restriction enzyme digestion and one of those with the predicted pattern of fragments was 
called pDM322. The env sequence incorporated in pDM322 is shown in Figure 1. 
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b) Gag (core) 



10 



15 



The region of the HIV-1 gag gene which encodes amino acids 121-356 was cloned into pXY460 in the 
following manner. 

There is a site for the enzyme Neil [CC(C/G)GG] at position 1180 (amino acid 356) within the gag gene. 

Neil 

. . • GGA GGA CCC GGC 

Gly Gly Pro Gly 

The 5' overhang generated by this enzyme was filled in by incubation with the Klenow fragment of E. 
coli DNA polymerase I in the presence of only dCTP. This end was then in-frame with the Smal site in 
pXY460. Similarly the Pvull site at position 468 (amino acid 121) gave a blunt end in-frame with the Smal 
site in pXY460. 

Pvull 

• • • GCA G CA GCT GAC 

Ala A la Ala Asp 



20 



The Pvull/Ncil(blunted) fragment (710bp) was purified by agarose gel electrophoresis and eiution from a 
gel slice and ligated with Smal-digested, phosphatased pXY460, The DNA was transformed into E coli 
strain TG1 and recombinants selected as blue colonies on L-Amp-Xgal plates. Individual transformants were 
characterised by restriction enzyme digestion and one of those which had the predicted pattern of 
25 fragments was called pDM614. Transformants were also characterised by their immunoreactivity with sera 
from AIDS patients. The gag sequence incorporated in pDM614 is shown in Figure 2. 



c) Gag/env 

30 

The HIV-1 sequences expressed individually in pDM322 and pDM614 were combined together in the 
following manner. In pDM614 a Smal site was retained at the 3' end of the gag insert because the filled-in 
Neil site represents "half" a Smal site (CCCGGG). When the EcoRI site of pDM322 was filled-in using the 
Klenow fragment of E. coli DNA polymerase I, the resulting blunt end was in-frame with the Smal site of 
35 P DM614. " 

DM324 DM322 

qag . . • GCA GGA CCC AAT TCC CCC AGA CAA . . . env 
Gly Gly Pro Asn Ser Pro Arg Gin 

40 

The plasmid DM322 was digested with EcoRI, filled-in and then digested with BamHI; the resulting 
410bp fragment was gel-purified. The plasmid pDM614 was digested with Smal and BamHI and gel 
purified. The two fragments were ligated together and transformed into E. coli TG1. The blue colonies 
45 selected on L-Amp-Xgal plates were analysed by restriction enzyme digestion. One transformant which had 
the predicted pattern of digestion fragments was called pDM624. The 1120bp EcoRI/BamHI fragment from 
pDM624 was also transferred to the plasmid pXY46X, which was derived from pXY460 by deletion of all of 
the lacZ gene and the insertion of in-frame termination codons next to the BamHI site, to produce pDM626. 
The DNA sequence of the gag/env fusion of pDM626 is shown in Figure 3. 

60 

d) Antigen production 

The recombinant E. coli strains were plated onto selective media (L-Amp or L-Amp-Xgal) and single 
colonies used to inoculate overnight cultures (<300ml). Portions of these inocula were added to larger 
volumes of medium (up to 3 litres) in fermentation vessels. The growth of the cultures was monitored and 
when the ODeoo reached 2.0-2.5 the inducer IPTG (isopropyl-0-D-thiogalactopyranoside) was added to a 
final concentration of 5ug/ml. The bacteria were then grown for a further 2-4 hours to allow induction of the 
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recombinant proteins. 

The cells were harvested by centrifugation and resuspended in buffer (25mM tris-CI, pH8.0, 1 mM 
EDTA. 0.2% Nonidet P40) to a final concentration of 100 ODeoo units. Extracts were prepared from the cells 
by one of two methods. 

s i. Lysozyme and PMSF were added to the resuspended cells to final concentrations of 1 mg/ml and 1 

mmol respectively and incubated overnight at 4* C. MgSCU (2mM) and DNasel (40 ug/ml) were added and 
incubation continued at 4*C. More PMSF was added to 2 mM and EDTA was adjusted to 5 mM. The 
extract was clarified by centrifugation at 15000g for 20 minutes. The supernatant was decanted and retained 
at-70 # C. 

io ii. The resuspended cells were passed through a French-pressure cell at an operating pressure of 12- 

15000 psl. PMSF and EDTA were added to the lysate to final concentrations of 2 mM and 5mM 
respectively. The extract was clarified by centrifugation at 15000g for 20 minutes. The supernatant was 
decanted and stored at -70* C. The passages through the pressure cell may be repeated to increase the 
release of antigen from the cells. 

15 

The env/fl-gal and gag//3-gal fusion proteins were purified by affinity chromatography making use of the 
/S-galactosidase enzyme protein engineered into them, either on an anti-galactosidase affinity column or on 
a substrate-affinity column. Following this procedure the antigens may be further purified on a size 
exclusion column, and are then homogeneous by analytical gel-electrophoresis. 
20 The gag/env fusion protein was purified from the insoluble fraction of lysed cells by differential 
extraction with a chaotropic agent (urea). The fraction soluble between 3 and 8M urea was further purified 
by chromatography on a phenyl Sepharose column with elution in 8M urea. The overall yield was greater 
than 70% of antigenic activity and greater than 80% of the protein in bands immunologically reacting as 
fusion proteins subsequent to electrophoresis (SDS-PAQE) and Western blotting. 

25 

Example 2. Baculovirus expression constructions 

The baculovirus Autographa californica nuclear polyhedrosis virus (AcNPV) has been developed into a 
30 useful helper-independent eukaryotic expression vector (Rohmann (1986) J. Gen. Virol. 67 1499-1513, 
Kuroda et al (1986) EMBO J. 5 No.6 1359-1365). The vector makes use of the polyhedrin gene of the virus 
which has two attractive properties. It is highly expressed late in the virus life-cycle. It is not essential for 
virus growth. Foreign genes are introduced into the polyhedrin gene so that they are expressed under the 
control of its promoter and other regulatory elements and the polyhedrin gene is itself inactivated. The 
35 AcNPV genome is very large and cannot be used for direct cloning; instead a transfer vector must be used. 
The transfer vector contains the polyhedrin gene and some flanking sequences, in an E. coli plasmid such 
as pUC8, with convenient restriction sites introduced into the gene. When the transfer vector is introduced 
into insect cells (e.g. Spodoptera frugiperda , FP9) which are also infected with AcNPV recombination can 
occur. The recombinants, which occur at a frequency of less than 1%, have replaced the wild-type 
40 polyhedrin gene with one which now expresses the foreign gene; such viruses are polyhedrin -minus and 
can be identified by their plaque morphology. We have expressed a truncated form of HIV-1 envelope 
protein and the gag/env protein in this system. 

45 a) Envelope 

The Dral site (AATATT) at position 6190 is 65bp upstream of the start of the HIV-1 env gene. This site 
was converted to BamHI by cloning into the vector pUC9 to produce pDM100. 

50 ... TAGGAAAAT ATTAAGACA ....-> GGATCCCCATTAAGAGA 

Dral BamHI 

There is a BamHI site within the env gene at position 8505 which truncates the coding sequence at 
amino acid 752. 

55 

750 BamHI 
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. . . GTG AAC GGA TCC TTA 
Val Asn Gly Ser Leu 



The 2.32Kb BamHI fragment from pDM100, purified by agarose gel electrophoresis as above, was 
ligated with the so-called baculovirus transfer vector pAc373 (Kuroda et al) and transformed into E coli 
strain HB101. A transformant which had the env insert in the correct orientation for expression from the 
poiyhedrin promoter was Identified by restriction enzyme mapping. This was then transfected into SP9 ceils 
infected with AcNPV. Poiyhedrin -minus viruses were identified and plaque purified by standard techniques 
(Summers and Smith "Manual of Methods for Baculovirus Vectors and Insect Culture Procedures" (1986) 
Texas Agricultural Experimental Station; "Current Topics in Microbiology and Immunology", Number 131, 
The Molecular Biology of Baculoviruses, Year: 1986, Editos: W. Doerfler and P. Bohm, Publishers: Springer 
Verlag). 



15 b) Gag/env 

The 1120bp EcoRI/BamHI fragment of pDM626 cannot be expressed directly using the pAc373 vector 
because it does not have its own initiating ATG codon. The EcoRI site at the 5 end of the fragment was 
modified by the addition of a synthetic oligonucleotide which provided an ATG and converted the EcoRI site 
20 to that for BgllL 

5' AATTCAT AGATCT ATG 3' 

3 f GTA TCTAGA TACTTAA 5' 
Bglll 

25 

The gag/env coding sequence was cloned into BamHI-digested pAc373 as a Bglll/BamHI fragment and 
recombinant AcNPV generated as described above. 

30 

c) Preparation of antigens 

The recombinant AcNPV were used to infect fresh cultures of SP9 ceils at a multiplicity of infection of 
1-10. The infected cells were maintained in stirred flasks at 28* C for 36-72 hours, at which time the cells 
35 were harvested and lysed at 2-5 x 10 7 cells/ml with 1% Nonidet NP40 to release the antigen. The antigen 
was clarified by centrifugation at 15000g av for 20 minutes at 4*C. The supernatant was stored at -70 # C 
until required. The antigen may be purified by affinity chromatography on lentil lectin columns. Antigens 
prepared in this fashion are significantly purified over the cell lysate but are not analytically pure. 

40 

Example 3: Vaccinia virus expression constructions 

Two clones were used to construct fusion proteins composed of the major antigenic epitope of foot-and- 
mouth disease virus (FMDV) fused to the amino-terminus of hepatitis B core antigen (HBcAg). One clone 

45 representing HBcAg was obtained from Dr. P. Highfield (pWRL 3123). This clone had been modified at the 
NH 2 terminus such that it could be expressed in bacteria as a fusion protein to the E. coli protein TRP E. 
pWRL 3123, harbored in E. coli HB101, was deposited at NCIB, Aberdeen, GB on 6 March 1987 under 
accession number NCIB 12423. 

A second clone representing FMDV VP1 142-160 sequences from 0i Kaufbeuren linked to the amino 

so terminus of j8-galactosidase was obtained from Dr. M. Winther (pWRL 201) (Winther et al, J. Immunol. 136, 
1835, 1986). Restriction maps of each clone are shown in Figure 4. As can be seen in Figure 4, the junction 
between the FMDV sequence and the /9-gaiactosidase comprises a BamHI restriction she. The strategy 
undertaken therefore involved the fusion of the FMDV sequence and the HBcAg sequence through this 
BamHI site. 

55 The initial stage in the construction therefore involved insertion of a synthetic oligonucleotide linker for 
BamHI at the 5' end of the HBcAg gene of pWRL 3123. The site used for insertion of the linker was the 
Narl site at position 290. However a second Narl site at position 1230 was also present in this plasmid. The 
plasmid was therefore partially digested with Narl so that a population of plasmid molecules which had been 
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cut at only one Narl site could be observed by agarose gel electrophoresis and purified. After flush ending 
the Narl sites using the Klenow fragment of DNA polymerase I, a synthetic oligonucleotide linker 
representing a BamHI site was llgated into the partial Narl digest and the resulting plasmids were used to 
transform E coli. Clones were then analysed for the presence of a BamHI linker in the correct Narl site by 
5 restriction mapping. 

One such clone, designated pEB208, was isolated and DNA prepared. The length of the BamHI linker 
had been specifically chosen so that, when ligated to the FMDV portion of pWRL201 (Fig 4), the 
translations reading frame would be continuous and a fusion protein could be produced. Concomitant with 
the insertion of the BamHI linker, the Narl site into which it had been inserted was destroyed. It was 

io therefore possible to remove the HBcAg sequence from pEB208 by BamHI • Narl digestion whereupon a 
DNA fragment of 940 bases was produced. Similarly a BamHI - Narl fragment from pWRL201 of 
approximately 3.5 kilobases purified. These two fragments were ligated together and the correct clone 
(pFOHc) was identified by restriction mapping. 

As can be seen from Figure 4, pFOHc can be expressed in bacterial cells under the control of the tac 

is promoter. In order to facilitate the transfer of the hybrid gene to a vaccinia virus (W) shuttle vector, 
however, plasmid pFOHc was cut at the single Narl site and a second EcoRI site was introduced as a 
synthetic linker. This enabled the complete hybrid gene to be isolated as an EcoRI fragment. 

The W shuttle vector was pVp11k which was derived from from the vector pH3JAR1A (Newton et al. 
Vaccines 86: New Approaches to Immunisation, Cold Spring Harbor Laboratory, 303-309, 1986) by deletion 

20 of extraneous W sequences. This shuttle vector has a W promoter (in this case p11K) inserted into W 
thymidine kinase (TK) gene. This vector has a unique EcoRI site immediately following the W p11k 
promoter and AUG (Bertholet et al, PNAS, 82, 2096, 1985). The EcoRI site and AUG are in the same 
translation^ reading frame as the amino terminal EcoRI site of the hybrid gene in pFOHc. The FMDV- 
HBcAg gene was therefore inserted as the EcoRI fragment into EcoRI cut dephosphorylated pVp11k. 

25 Clones with the hybrid gene in the correct orientation relative to the p11k promoter were identified by 
restriction mapping. This clone was designated pvFOHc (Figure 5). 

This shuttle plasmid was then inserted into the genome of the Wyeth (US vaccine) strain of W. under 
the control of the p11k promoter, by homologous recombination using the flanking TK sequences. Individual 
progency plaques with a TK"" phenotype were screened for the presence of FMDV-HBcAg DNA by dot blot 

30 hybridisation. 

CV-1 Cell lysates from wild-type (Wyeth) and recombinant (vFOHc) infected cells were screened for the 
presence of core antigen and for FMDV sequences by sandwich ELISA. Antigen from infected cells was 
bound to EUSA plates using either FMD virus particle (146S) or FMD VP1 141-160 antisera raised in 
rabbits. Each trapped antigen was then assessed for the presence of either HBc, FMD 146S or FMD VP1 

35 142-160 epitopes by binding with the respective guinea pig antisera and development with anti guinea pig 
peroxidase conjugate. The results are shown in Figure 6. As can be seen in Fig 6, a protein recombinant 
from (vFOHc) infected cell lysates was trapped with anti-FMDV 141-160 antiserum and this protein could 
then react with anti HBc, anti-FMDV 141-160 and FMDV antivirion serum in a sandwich ELISA. 

Furthermore, this protein could be purified by ultracentrifugation suggesting that it was particulate in 

40 nature. This was illustrated more clearly when the products of centrifugation were sedimented on a sucrose 
density gradient and fractions were re-assayed for the presence of core antigen by ELISA Cell lysates from 
recombinant (vFOHc) vaccinia virus infected celjs or bacteria expressing native core antigen were frac- 
tionated on 15-45% sucrose gradients. Fractions were assayed for the presence of core reactive material by 
indirect sandwich ELISA using human anticore antiserum as trapping antibody and guinea pig HBc antigen 

45 antiserum for detection. The results are shown in Figure 7. The position at which FMD virus sediments is 
also indicated. Fig 7 shows that a peak of HBcAg reactive material was observed in a position similar to that 
observed when core particles expressed in bacteria were centrifuged in parallel. Thus it appears that the 
presence of the FMDV VP1 142-1 so sequence does not interfere with the particulate nature of the core 
particles. 

50 The ability of the fusion protein to self assemble into regular, 27nm core like particles was confirmed by 
electron microscopic examination of immune complexes formed with sucrose gradient purified material. The 
complexes were formed by reacting the FMDV-HBcAg particles with antiserum raised to intact foot and 
mouth disease virus. The complexes were adsorbed to form over coated grids and negatively stained with 
phosphotungstic acid. As was to be expected from the ELISA data shown in Fig 6, immune complexes were 

55 also seen after reacting the particles with antisera to HBcAg or to synthetic FMDV peptide 141-160. 



Example 4: Assay using recombinant HIV antigens 
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a) Preparation of coated microwells 

The antigens obtained from insect cells in Example 2 were coated at a predetermined optimum 
concentration onto the microwells by passive adsorption from an amine-containing buffer at pH 8. The wells 
5 were then supercoated with a solution containing high levels of a bovine protein to ensure that any 
remaining hydrophobic sites are fully occupied. 



b) Preparation of conjugates 

w 

1. The purified antigens from E. coli from Example 1 were labelled with alkaline phosphatase. The 
alkaline phosphatase was attached to the sulphydryl radicals of the fl-galactosidase using well established 
maleimide-sulphydryl chemistry. 

2. The conjugates were freeze-dried in a matrix of a sugar alcohol with serum protein additives, and 
rs reconstituted before use with a diluent containing the metal cofactors of alkaline phosphatase. 



c) Performing an assay 

20 

1. Sequential format. 

A sample to be assayed for the presence of anti-HIV antibodies was added to a microwell and 
incubated for 30 minutes at a temperature of 45* C. The residues were washed from the microwell and then 

25 the conjugate was added to the well and incubated for 30 minutes at a temperature of 45* C. The excess 
conjugate was removed by washing. Then the presence of alkaline phosphatase was detected using the 
cyclic amplification system described previously. Any significant amount of enzyme bound to the well 
indicated the presence in the sample of antibodies to the envelope proteins of HIV. 

The assay was tested using 1662 sera known not to contain antibodies to the envelope proteins of HIV, 

30 and with 6 sera containing such antibodies. The results are shown in Table 1 ("Insect cell cultured 
antigen"), with for comparison an assay performed with the antigens derived from E.coli both as conjugate 
and as coating protein ("E. coli cultured antigen"). It can be seen, that the background colouration is higher 
in the latter case and that there occur samples which give rise to signals which can be ascribed to 
impurities in the antigen preparations and not to the presence of genuine anti-HIV antibodies. These 

35 samples do not give rise to signals on those assays wherein the coating and conjugated- antigens are 
prepared from different sources. 

Table 1 

40 

(a) Backgrounds generally 
Insect cell cultured antigen : assay background = 0.15 1 0.03 O.D. Units 
E. coli cultured antigen : assay background * 0.26 + 0.05 
45 (b) False positive data (ascribable to anti-coli activity) 

E. coli cultured antigen : 2 false positive signals from 1662 samples 
Not reactive on Insect cell cultured antigen 

(c) Positives found as positive using Insect cell cultured antigen 6/6 

50 

Example 5: Detection of antibody against HIV by competition for binding of a labelled antibody itself 
directed to lhe gag/env protein of the invention 

A classic competitive enzyme immunoassay (EIA) was used. The gag/env fusion protein, designated 
55 here 626, obtained in Example 1 was coated onto microtitre wells by capture through a monoclonal 
antibody (TL03) directed against p24. Samples were then added to the prepared wells, and a conjugated 
anti-HIV added immediately. The samples were serum samples and plasma samples from blood donors and 
serum samples from patients with AIDS, AIDS-associated condition and other diseases. The enzyme used 
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in the conjugate was peroxidase. After an incubation period of about 1 hour the wells were washed and 
substrate for the enzyme was added. This was 3,3 I 5,5'"tetramethyl benzidine. Anti-HIV in the sample was 
ascertained by comparison with a standard taken through the procedure. The results are given in Tables 2 
and 3. 

5 

Table 2 



Detection of antibody to HIV in serum samples and plasma samples 






from blood donors 




Centre 


No. samples 


Non-reactive 


Initially 


Repeatably 




tested 




reactive 


reactive 


1 


1699 


1696 


3 


1 (0.06%) 


2 


1783 


1780 


3 


2(0.11%) 


3 


2037 


2034 


3 


1 (0.05%) 


4 


1908 


1904 


4 


2* (0.10-0.16%) 


5 


975 


974 


1 


0 


Total 


8402 


8388 


14(0.17%) 


6 (0.07%) 



" Only 3/4 samples retested 



Table 3: Reactivity of sera from patients with AIDS, AIDS 
25 associated condition and other disease's" ! 

Clinical No. of Antibody Confirmed 

Group Samples Positive Antibody Positive 1 



30 



AIDS 59 59 59 

AIDS related 

complex 62 62 62 

AIDS associated 
35 conditions 6 97 97 97 

High risk d 426 272* 271 

Diseases 
<o unrelated to 

AIDS* 67 l 4 * 0 



45 



50 



b 



55 
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Miscellaneous' 80 0 0 

* Confirmation was by Western Blot and/or at least two 
s alternative immunoassays (except b below). 

b Samples from 30 AIDS patients were confirmed with one 

10 

alternative immunoassay. 

, 5 c includes patients with Persistent Generalised 

Lymphadenopathy, Kaposi's sarcoma, opportunistic infections 
and patients known to be HIV antibody positive. 

20 

d Patients in established risk groups. 

25 

4 Patients with acute viral diseases, autoimmune disease, 
neoplasia. 

30 

* Includes samples from healthy individuals and patients 
with undefined conditions. 

35 

* The discrepant sample (from an IV drug abuser) gave an 
^ uninterpretable Western blot. 

The discrepant sample was grossly haemolysed. 

45 



Example 6: Detection of antibody against HIV by labelling with conjugated anti-globulin 

This is the standard method of detecting antibodies to HIV, 626 from Example 1 was coated onto 
microwells by passive adsorption. Samples of 50ul of serum, pre-diluted 1/100, were added to the wells. 
After an incubation of about 30 minutes the samples were washed out of the wells and 50ul conjugated anti- 
human globulin added. The enzyme used in the conjugate was peroxidase. After incubation for a further 30 
minutes approximately, the wells were again washed and enzyme substrate added. The presence of anti- 
HIV was detected by comparison with a standard taken through the procedure. The results are given in 
Table 4. 
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Table 4 



Indirect antiglobulin test 


Number 
Negatives 
Tested 


Number 
Negative 


Number 
Positives 
Tested 


Number 
Positive 


55 


55 


25 


25 



t 
10 

Example 7: Detection of antibody against HIV by use of labelled 626 

/5 626 from Example 1 was coated passively onto microwells as in Example 6. Undiluted 250ul samples 
were added to the prepared wells. Conjugated 626 (50ul) was added immediately after samples. After an 
incubation for about an hour the wells were washed and substrate added. The enzyme used was alkaline 
phosphatase. The results are shown in Table 5. 

20 Table 5 



Direct sandwich assay 




Number 


Number 


Number 


Number 




Negatives 


Negative 


Positives 


Positive 




Tested 




Tested 




Sera 


810' 


809 


58 


58 


Plasma 


175' 


169 







30 ' 1 false positive sera and 6 false positive plasmas (not all repeatably false 

positive) 



Example 8: Detection of captured antibody against HIV using 626 labelled with a particle 

A standard agglutination test was effected. Latex particles of a diameter of 0.2 micron precoating were 
coated passively with 626 obtained in Example 1. Samples of sera or plasma were mixed with the latex 
either using an apparatus or by stirring on a surface. The presence of antibodies to HIV (which cause 
agglutination of the particles) was ascertained by visual inspection, or by appropriate instrumentation, a few 
minutes after mixing the reagents. The results are shown in Table 6. 

Table 6 



Latex Agglutination: Assay 




Number 


Number 


Number 


Number 




Negatives 


Negative 


Positives 


Positive 




Tested 




Tested 




Sera 


480 


479 


80 


80 | 


Plasma 


50 


50 
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Example 9: Use of 626 to label anti-HlV antibody captured by anti-globulin 

Anti-human globulin was coated passively onto microwells. Samples each of 50ul of undiluted serum 
were added to the prepared wells. 626 from Example 1 labelled with alkaline phosphatase was added 
5 immediately. After an incubation of about one hour the wells were washed and substrate added. The 
presence of anti-HIV antibody was ascertained by comparison with a standard. The results are shown in 
Tables 7 and 8. 

Table 7 



Anti-human capture assay 


Number 
Negatives 
Tested 


Number 
Negative 


Number 
Positives 
Tested 


Number 1 
Positive 


16 


16 


15' 


12 



• includes 3 weak positives not detected 



20 

Table 8 



25 



Anti-human capture assay 


Number 
Negatives 
Tested 


Number 
Negative 


Number 
Positives 
Tested 


Number 
Positive 


138* 


137 


46" 


45 



* 1 false positive 

" very weak positive not detected 



Claims 

1 . An immunoassay for an antibody, which immunoassay comprises: 

40 (i) contacting a solid phase, on which is immobilised a first recombinant peptide which presents an 
antigenic sequence to which the antibody is capable of binding, with a test sample; 

(ii) contacting the solid phase with a second recombinant peptide which presents the said antigenic 
sequence and which is labelled; and 

(iii) determining whether the test sample contained any said antibody; 

45 characterised in that the second recombinant antigen was expressed in an organism of a different genus 
than that in which the first recombinant peptide was expressed. 

2. An immunoassay according to claim 1, wherein one of the recombinant peptides was expressed in a 
procaryotic organism whilst the other was expressed in a eucaryotic organism. 

3. An immunoassay according to claim 2, wherein one of the recombinant peptides was expressed in E 
so coli and the other was expressed in insect cells or mammalian cells by way of a baculovirus or vaccinia 

virus expression system respectively. 

4. An immunoassay according to any one of the preceding claims, wherein the recombinant peptides 
are MIV gag and/or env sequences. 

5. A test kit for use in an immunoassay for an antibody, which kit comprises: 

55 (a) a solid phase on which is immobilised a first recombinant peptide which presents an antigenic sequence 
to which the antibody is capable of binding; and 
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(b) a second recombinant peptide which presents the said antigenic sequence and which is labelled; 
characterised in that the second recombinant antigen was expressed in an organism of a different genus 
than that the first recombinant peptide was expressed. 
6. A protein of the sequence: 

5 

10 20 
MetAsnSerProAspThrGlyHisSerSerGlnValSerGlnAsnTyrProIleValGln 
pl8> p24> 

to 30 40 

AsnlleGlnGlyGlnMetValHisGlnAlalleSerProArgThcLeuAsnAlaTrpVal 

50 60 
Ly sVa lVa 1G1 uGl uLy s Al aPheSe r Pr oGl uVa UlePr oMe tPheSe r Al aLe uSe r 

T5 

70 80 
GluGlyAlaThrProGlnAspLeuAsnThrMetLeuAsnThrValGlyGlyHisGlnAla 

90 100 
20 Al aMe tGl nMe tLe uLy sGl uThrll eAsnGl uGl uAl a Al aGl uTc pAspAr gVa 1 Hi s 

110 120 
Pr ova 1H 1 s Al aGl yPr oil eAl aPc oGl yGl nMe tAr g Gl uPc oAr gGl ySe c AspZl e 

25 130 140 

AlaGlyThrThrSerThrLeuGlnGluGlnlleGlyTrpMetThrAsnAsnProProIle 

150 160 
ProValGlyGluIleTyrLysArgTrpIlelleLeuGlyLeuAsnLysIleValArgMet 



30 



35 



40 



45 



SO 



170 180 
TyrSerProThrSerlleLeuAspIleArgGlnGlyProLysGluProPheArgA8pTyr- 



55 
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190 200 
ValAspArgPheTyrLysThrLeuArgAlaGluGlnAlaSerGlnGluValLysAsnTrp 

210 220 
MetThrGluThrLeuLeuValGlnAsnAlaAsnProAspCysLysThrlleLeuLysAla 

230 240 
LeuGlyProAlaAlaThrLeuGluGluMetMetThrAlaCysGlnGlyValGlyGlyPro 

250 260 
AsnSerProArgGlnLeuLeuSerGlylleValGlnGlnGlnAsnAsnLeuLeuArgAla 
gp41> 

270 280 
IleGluAlaGlnGlnHisLeuLeuGlnLeuThtValTrpGlylleLysGlnLeuGlnAla 

290 300 
ArglleLeuAlaValGluArgTyrLeuLysAspGlnGlnLeuLeuGlylleTrpGlyCys 

310 320 
SerGlyLysLeuIleCysThrThrAlaValProTrpAsnAlaSerTrpSerAsnLysSer 

330 340 
LeuGluGlnlleTrpAsnAsnMetThrTrpHetGluTrpAspArgGluIleAsnAsnTyr 

350 360 
ThrSerLeuIleHisSerLeuIleGluGluSerGlnAsnGlnGlnGluLysAsnGluGln 

370 

GluLeuLeuGluLeuAspLysTrpAlaSerLeuTrpAsnTrpPheAsnGlyAspPro; 



optionally modified by one or more amino acid substitutions, insertions and/or deletions and/or by an 
extension at either or both ends provided that a protein having such a modified sequence is capable of 
binding to both anti-p24 and anti-gp41 and there is a degree of homology of at least 75% between the 
modified and the unmodified sequences. 

7. A process for the preparation of a recombinant protein by 

(i) transforming a host cell with a vector which incorporates a gene encoding the protein and which is 
capable, in the host cell, of expressing the protein; 

(ii) culturing the transformed host cell so that the protein is expressed; and 
(Hi) recovering the protein; 

characterised in that the said gene encodes a protein as claimed in claim 6. 

8. An assay for anti-p24 and/or anti-gp4t HIV-1 antibody, which assay comprises contacting a test 
sample with a protein capable of binding to anti-p24 and/or anti-gp41 HIV-1 antibody and determining 
whether any of the said antibody binds to the protein, characterised in that the protein is a protein as 
claimed in claim 6. 

9. A test kit for use in an assay for anti-p24 and/or anti-gp4l HIV-1 antibody, which kit comprises a 
protein capable of binding to anti-p24 and/or an«-gp41 HIV-1 antibody and means for determining whether 
any of the said antibody in a test sample binds to the protein, characterised in that the protein is a protein 
as claimed in claim 6. 

10. A pharmaceutical composition comprising a pharmaceutically acceptable carrier or diluent and, as 
active ingredient, a protein as claimed in claim 6. 
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30 60 
ATG AAT TCC CCC AGA CAA TTA TIG TCT GGT ATA GTG CAG CAG CAG AAC AAT TTG CTG AGG 
Met Asn Ser Pro Arg Gin Leu Leu Ser Gly He Val Gin Gin Gin Asn Asn Leu Leu Arg 

90 120 
GCT ATT GAG GCG CAA CAG CAT CTG TTG CAA CTC ACA GTC TGG GQC ATC AAG CAG CTC CAG 
Ala He Glu Ala Gin Gin His Leu Leu Gin Leu Thr Val Trp Gly He Lys Gin Leu Gin 

150 180 
GCA AGA ATC CTG GCT GTG GAA AGA TAC CTA AAG GAT CAA CAG CTC CTG GGG ATT TGG GGT 
Ala Arg He Leu Ala Val Glu Arg Tyr Leu Lys Asp Gin Gin Leu Leu Gly He Trp Gly 

210 240 
TQC TCT GGA AAA CTC ATT TGC ACC ACT GCT GTG CCT TGG AAT GCT AGT TGG AGT AAT AAA 
Cys Ser Gly Lys Leu He Cys Thr Thr Ala Val Pro Trp Asn Ala Ser Trp Ser Asn Lys 

270 300 
TCT CTG GAA CAG ATT TGG AAT AAC ATG ACC TGG ATG GAG TGG GAC AGA GAA ATT AAC AAT 
Ser Leu Glu Gin He Trp Asn Asn Met Thr Trp Met Glu Trp Asp Arg Glu He Asn Asn 

330 360 
TAC ACA AGC TTA ATA CAC TCC TTA ATT GAA GAA TCG CAA AAC CAG CAA GAA AAG AAT GAA 
Tyr Thr Ser Leu He His Ser Leu He Glu Glu Ser Gin Asn Gin Gin Glu Lys Asn Glu 

390 420 
CAA GAA TTA TTG GAA TTA GAT AAA TGG GCA AGT TTG TGG AAT TGG TTT AAC GGG GAT CCC 
Gin Glu Leu Leu Glu Leu Asp Lys Trp Ala Ser Leu Trp Asn Trp Phe Asn Gly Asp Pro 

gp41 pXY460 

FIGURE 1 



POOR QUALITY 




pXY460 Gag 



30 60 
ATG AAT TCC OCT GAC ACA GGA CAC AQC AGT CAG GTC AGC CAA AAT TAC OCT ATA GTG CAG 
Met Asn Ser Pro Asp Thr Gly His Ser Ser Gin Val Ser Gin Asn Tyr Pro lie Val Gin 

90 120 
AAC ATC CAG GGG CAA ATG GTA CAT CAG GCC ATA TCA CCT AGA ACT TTA AAT GCA TOG GTA 
Asn He Gin Gly Gin Met Val His Gin Ala He Ser Pro Arg Thr Leu Asn Ala Trp Val 

150 180 
AAA GTA GTA GAA GAG AAG GCT TIC AGC CCA GAA GTG ATA CCC ATG TTT TCA GCA TTA TCA 
Lys Val Val Glu Glu Lys Ala Phe Ser Pro Glu Val He Pro Met Phe Ser Ala Leu Ser 

210 240 
GAA GGA GCC ACC CCA CAA GMT TTA AAC ACC ATG CTA AAC ACA GTG GGG GGA CAT CAA GCA 
Glu Gly Ala Thr Pro Gin Asp Leu Asn Thr Met Leu Asn Thr Val Gly Gly His Gin Ala 

270 300 
GCC ATG CAA ATG TTA AAA GAG ACC ATC AAT GAG GAA GCT GCA GAA TGG GAT AGA GTG CAT 
Ala Met Gin Met Leu Lys Glu Thr He Asn Glu Glu Ala Ala Glu Trp Asp Arg Val His 

330 360 
CCA GTG CAT GCA GGG CCT ATT GCA CCA GGC CAG ATG AGA GAA CCA AGG GGA AGT GAC ATA 
Pro Val His Ala Gly Pro He Ala Pro Gly Gin Met Arg Glu Pro Arg Gly Ser Asp He 

390 420 
GCA GAA ACT ACT AGT ACC CTT CAG GAA CAA ATA GGA TGG ATG ACA AAT AAT CCA CCT ATC 
Ala Gly Thr Thr Ser Thr Leu Gin Glu Gin He Gly Trp Met Thr Asn Asn Pro Pro He 

450 480 
CCA GTA GGA GAA ATT TAT AAA AGA TGG ATA ATC CTG GGA TTA AAT AAA ATA GTA AGA ATG 
Pro Val Gly Glu He Tyr Lys Arg Trp He lie Leu Gly Leu Asn Lys He Val Arg Met 

510 540 
TAT AGC CCT ACC AGC ATT CTG GAC ATA AGA CAA GGA CCA AAA GAA OCT TTT AGA GAC TAT 
Tyr Ser Pro Thr Ser He Leu Asp He Arg Gin Gly Pro Lys Glu Pro Phe Arg Asp Tyr 

570 600 
GTA GAC CGG TTC TAT AAA ACT CTA AGA GCC GAG CAA GCT TCA CAG GAG GTA AAA AAT TGG 
Val Asp Arg Phe Tyr Lys Thr Leu Arg Ala Glu Gin Ala Ser Gin Glu Val Lys Asn Trp 

630 660 
ATG ACA GAA ACC TTG TTG GTC CAA AAT GCG AAC CCA GAT TGT AAG ACT ATT TEA AAA GCA 
Met Thr Glu Thr Leu Leu Val Gin Asn Ala Asn Pro Asp Cys Lys Thr He Leu Lys Ala 

690 720 
TTG GGA CCA GGA GCT ACA CTA GAA GAA ATG ATG ACA GCA TGT CAG GGA GTG GGA GGA CCC 
Leu Gly Pro Ala Ala Thr Leu Glu Glu Met Met Thr Ala Cys Gin Gly Val Gly Gly Pro 

pX*460 



GGG GAT CCC 
Gly Asp Pro 



FIGURE 2 



FIGURE 3 



10 20 30 40 50 60 

MGRWTCOC CTGACACAQG ACACAGCAGT CAGGTCAGCC AAAATIACCC TATAGTGCAG 
>pl8 >P24 

70 80 90 100 110 120 

AACATCCAGG GGCAAATGGT ACATGAGGCC ATATCACCTA GMCTTTAM TQCATGOGTA 

130 140 150 160 170 180 

AAAGTAGTAG AAGAGAAGGC TTTCAGCCCA GAAGTGATAC CCATGTITTC AGCATIATCA 

190 200 210 220 230 240 

GAAGGAGCCA CCCCACAAGA TTTAAACACC ATGCTAAACA CAGTGGGGQG ACATCAAGCA 

250 260 270 280 290 300 

GOCATGCAAA TGTTAAAAGA GACCATCAAT GAGGAAGCTG CAGAATGGGA TAGAGTGCAT 

310 320 330 340 350 360 

CCAGTGCATG GAGGGCCTAT TGCACCAGGC CAGATGAGAG AACCAAGGGG AAGTGACATA 

370 380 390 400 410 420 

QCAGGAACTA CTAGTACCCT TCAGGAACAA ATAGGATGGA TGACAAATAA TCCACCTATC 

430 440 450 460 470 480 

CCAGTAGGAG AAATTIATAA AAGATGGATA ATCCTGGGAT TAAATAAAAT AGTRAGAATG 

490 500 510 520 530 540 

TATAGCCCTA CCAGCATTCT GGACATAAGA CAAGGACCAA AAGAACCTTT TAGAGACTAT 

550 560 570 580 590 600 

GTAGACCGGT TCTATAAAAC TCTAAGAGCC GAGCAAGCTT CACAGGAGGT AAAAAA1TGG 

610 620 630 640 650 660 

ATGACAGAAA O CITGT1G GT CCAAAATGCG AACCCAGATT GTAAGACTAT TTTAAAAGCA 

670 680 690 700 710 720 

TTQGGACCAG CAQCTACACT AGAAGAAATG ATGACAGCAT GTCAGGGAGT GGGAGGACCC 

730 740 750 760 770 780 

AATTCCCCCA GACAATTATT GTCTGGTATA GTGCAGCAGC AGAACAATTT GCPGAGGGCT 
>gp41 

790 800 810 820 830 840 

ATTGAGGCGC AACAGCATCT GTTGCAACTC ACAGTCTQGG GCATCAAGCA GCTCCAGGCA 

850 860 870 880 890 900 

AGAATCCTGG CTGTGGAAAG ATACCTAAAG GATCAACAGC TCCTGGGGAT TPGGGGTTGC 

910 920 930 940 950 960 

TCTGGAAAAC TCATTTGCAC CACTGCTGTG CCTTGGAATG CTAGTTGGAG TAATAAATCT 

970 980 990 1000 1010 1020 

CTGGAACAGA TTTGGAATAA CATGACCTGG ATGGAGTGGG ACAGAGAAAT TAACAATTAC 

1030 1040 1050 1060 1070 1080 

ACAAGCTIAA TACACTCCTT AATTGAAGAA TCGCAAAACC AGCAAGAAAA GAATGAACAA 

1090 1100 1110 1120 1130 . 1140 

GAATIATTGG AATTAGATAA ATGGGCAAGT TTGTGGAATT GGTTIAACGG GGA3CQCDA 
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